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Abstract This article studies a biased version of the naming game in which players located on 
(f) . a connected graph interact through successive conversations to bootstrap a common name for 

a given object. Initially, all the players use the same word B except for one bilingual individual 
' who also uses word A. Both words are attributed a fitness, which measures how often players 

speak depending on the words they use and how often each word is pronounced by bilingual 
individuals. The limiting behavior depends on a single parameter: <f> = the ratio of the fitness 
, of word A to the fitness of word B. The main objective is to determine whether word A can 

invade the system and become the new linguistic convention. In the mean-field approximation, 
invasion of word A is successful if and only if <j> > 3, a result that we also prove for the process 
on complete graphs relying on the optimal stopping theorem for supermartingales and random 
walk estimates. In contrast, for the process on the one-dimensional lattice, word A can invade 
the system whenever <j> > 1.053 indicating that the probability of invasion and the critical 
Qh . value for <j> strongly depend on the degree of the graph. The system on regular lattices in 

^ C"| higher dimensions is also studied by comparing the process with percolation models. 

1. Introduction 

The naming game was first proposed by Stells [7] to describe the emergence of conventions and 
shared lexicons in a population of individuals interacting through successive conversations, but a 
■ number of variants of the model have also been introduced and studied numerically by statistical 

physicists, and we refer to Section V.B of [2] for a review of these different variants. The reason for 
the popularity of the naming game in the physics literature is that it is similar mathematically to 
traditional models in the field of statistical mechanics. The model studied in this paper is a biased 
version of the spatial naming game considered by Baronchelli et al. [TJ. Their system consists of 
a population of individuals located on the vertex set of a finite connected graph that has to be 
thought of as an interaction network. Each individual is characterized by an internal inventory of 
words that are synonyms describing the same object. All inventories are initially empty and evolve 
through successive conversations: at each time step, an edge of the network is chosen uniformly 
at random, which causes the two individuals connected by the edge to interact. One individual is 
chosen at random to be the "speaker" making the other individual the "hearer" . If the speaker does 
not have any word to describe the object then she invents one, whereas if she already has some 
words then she chooses one at random to be passed to the hearer. The conversation results in the 
following alternative: if the hearer already has the word pronounced in her internal inventory then 
this word is selected as the norm by both individuals - all the other words are removed from both 
inventories - otherwise the hearer adds the word pronounced to her inventory. 

Based on numerical simulations, Baronchelli et al. pQ studied the maximum number of words 
present in the system as well as the time to global consensus, i.e., the time until all inventories 
consist of the same single word. In contrast, we use the naming game to study whether a new 
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word can spread into a population that is already using another word as a convention, i.e., we 
assume that initially all the inventories reduce to the same single word, say word B, except for one 
individual who also has another word in her inventory, say word A. Under the symmetric rules of 
the naming game, the probability that A becomes eventually the new convention tends to zero as 
the population size goes up to infinity so we look at biased versions of the naming game in which 
each word is attributed a fitness. In our model, the fitness of each word measures the fitness of each 
individual, that is how likely they are selected as a speaker rather than hearer, and also how likely 
each word is selected to be pronounced by bilingual individuals, i.e., individuals who possess both 
words in their internal inventory. Another significant difference between this article and previous 
works about the naming game is that it provides a rigorous analysis of the model on both finite and 
infinite graphs rather than results based on numerical simulations which are unavoidably restricted 
to finite graphs. Also, we describe the dynamics in continuous time rather than discrete time, i.e, 
we assume that conversations occur at rate one along each edge of the graph, in order to have a 
model well defined on finite and infinite graphs. 

To describe our biased version of the naming game more formally, we let 4>a and <pB denote the 
fitness of word A and word B, respectively, and set 

<Pab ■= (1/2) (4>a + 4>b) and px^Y '■= 4>x (<Px + ^y)" 1 
for all X, Y G {A,B,AB}. Note in particular that 

Px^x = 1/2 and p x ^Y + Py^x = 1- 

The average fitness <j>AB represents the fitness of bilingual individuals. In each interaction, the 
individual playing the role of the speaker is chosen at random with probability her fitness divided 
by the overall fitness of the pair: when the neighbors are in state X and Y, the individual chosen to 
be the speaker is the individual in state X with probability px^Y- Similarly, given that a bilingual 
individual is chosen as the speaker, the conditional probability that word A is pronounced is equal 
to the relative fitness pa^b- In particular, each edge becomes active at rate one, which results in 
the following possible transitions for the states at the vertices connected by the edge: 



(A,B) 


* (A, AB) 


with 


probability 


PA^B 




* (AB,B) 


with 


probability 


PB^A 


(A,AB) 


* (A, A) 


with 
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PA^AB + PAB^A PA^B 




■> (AB,AB) 


with 


probability 


PAB^A PB^A 


(B, AB) 


+ (B,B) 


with 


probability 


PB->AB + PAB^B PB^A 




■> (AB,AB) 


with 


probability 


PAB^B PA^B 


(AB, AB) - 


+ (A, A) 


with 


probability 


PA^B 




* (B,B) 


with 


probability 


PB^rA 



(1) 



Note that, when the fitnesses are equal, one recovers the transition probabilities of the unbiased 
naming game described above. We formulate the dynamics using two parameters to have natural 
notations that preserve the symmetry between both words, but we point out that the long-term 
behavior of the process only depends on the ratio 4> := c^a/^b- 




Mean-field model. Before stating our results for the spatial stochastic model, we look at its 
nonspatial deterministic mean-field approximation, i.e, the model obtained by assuming that the 
population is well- mixing. This results in the following system of differential equations: 

u' A = UaUab(1 -2PAB-+APB->a) ~ U A U B PB^A + 2u 2 AB PA^B 
u' B = U B UAB (1 - 2pAB^B PA^B) ~ U A U B PA^B + 2 U 2 AB PB^A 

u'ab = -(ua + u b )' 

where ux denotes the frequency of type X individuals for X € {A, B, AB}. The mean-field model 
has two trivial equilibria, namely 

e A := (1,0,0) and e B := (0,1,0) 

which correspond to the configuration in which all individuals are of type A and the configuration 
in which all individuals are of type B, respectively. We say that word A can invade word B in the 
mean-field model whenever the system starting from any initial state different from e b converges to 
the trivial equilibrium eA- Regardless of the ratio <fi := 4>a/4>b, the frequency of type A individuals 
might decrease because the boundary uab = is repelling, but looking instead at the difference 
between the frequency of individuals using word A and word B gives 

(ua ~ U B )' = UA UAB (1 - %PAB-+A PB^a) 

- U B UAB (1 - 2p A B^B PA^b) + {u A U B + 2u 2 AB )(p A ^B ~ PB^a) 

= (3</»-l)(3(/) + l)- 1 nAnAB 



+ ((/)- 3)^ + 3)- 1 ubuab + {4>~ 1)(0 + I) -1 (u A u B + 2u 



2 ) 
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which is positive for all <j) > 3 when ua ^ 1 and ub ^ 1- This implies that there is no equilibrium 
other than the two trivial equilibria and that word A can invade word B for all (p > 3. This 
condition is sharp in the sense that e# is locally stable when cf) < 3. Indeed, the Jacobian matrix 
of the system of differential equations at point &b reduces to 



<'B 



/ -PB^A 
~PA->B 
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where qA '■= Pab^b PA^B is a key quantity that will appear again later. The eigenspace associated 
with the eigenvalue zero is generated by the vector (0, 1,0) which is not oriented in the direction 
of the two-simplex containing the solution curves. The other two eigenvalues are 

-PB^A = < and 2q A -l = (0 - 3)(0 + 3)" 1 

which are both negative when <p < 3. In particular, for all <p < 3, the equilibrium eg is locally 
stable, therefore word A cannot invade. Note that the obvious symmetry of the model also implies 
that both trivial equilibria are locally stable when 1/3 < <p < 3. Numerical simulations of the 
mean-field model suggest that, in this case, there is an additional nontrivial fixed point which is a 
saddle point, therefore the system is bistable: for almost all initial conditions, the system converges 
to one of the two trivial equilibria (see Figure Q] for pictures of the solution curves). 

Spatial stochastic model. We now look at the spatial stochastic naming game ([1]). For the 

stochastic process, the main objective is to study the probability that word A invades the population 
and is selected as a new linguistic convention when starting with a single bilingual individual and all 
the other individuals of type B. Note that, for non-homogeneous graphs, this probability depends 
on the location of the initial bilingual individual. Also, letting r]t{x) be the state of the individual 
at vertex x at time t, and letting P x denote the law of the process starting with 

rjo(x) = AB and %(y) = B for all y G V, y ^ x 

we define the probability of invasion as 

PA ■■= infxgv Px (hm^oo r) t (y) = A for all y £ V). (2) 

Interestingly, our results indicate that the probability of invasion strongly depends on the topology 
of the network of interactions, suggesting that, on regular graphs, it is decreasing with respect to 
the degree of the network, a property that cannot be captured by the mean-field model since it 
excludes any spatial structure. To begin with, we look at finite graphs. Our first theorem extends 
the first result found for the mean-field model: word A can invade word B for all ^ > 3. 

Theorem 1 - Assume that G is finite and (ft > 3. Then, > 1 — 3/0 > 0. 

Note that on finite graphs pa is always positive but might vanish to zero as the population size 
increases. In contrast, Theorem [1] shows more particularly that the probability of invasion is bounded 
from below by a constant that depends on the ratio (j) but not on the number of vertices. The idea 
of the proof is to show first that a certain function of the number of type A individuals and the 
number of type B individuals is a supermartingale with respect to the c-algebra generated by the 
process and then apply the optimal stopping theorem. Our next result indicates that the invadability 
condition in Theorem Q] is sharp for complete graphs in the sense that the probability of invasion 
vanishes to zero as the population increases when <j> < 3. 

Theorem 2 - Assume that G is the complete graph with N vertices. Then, 

lim pa = for all (j) < 3. 
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In the proof of Theorem HJ the dynamics of the number of type A and type B is expressed as a 
function of the number of edges of different types. The complete graph is the only graph for which 
the number of edges of different types can be expressed as a function of the number of individuals 
of different types. Also, one of the keys to proving the theorem is to use the fact that, on complete 
graphs, the number of individuals in different states becomes a Markov chain. The combination of 
both theorems indicates that the dynamics of the naming game on complete graphs is well captured 
by the mean-field approximation. Our next result shows more interestingly that this is not true for 
the process on the infinite one-dimensional lattice, suggesting that the critical value for the ratio 
of the fitnesses decreases as the degree of the graph decreases. 

Theorem 3 - In one dimension, pa > whenever (f> > c where 



6o -t- v uuy / o 

c := « 1.053 satisfies 48c - 23c - 29 = 0. 

96 

The proof of Theorem [3] is based on the analysis of the interface between individuals in different 
states, which is only possible in one dimension. The bound c is not sharp but our approach to 
prove the theorem together with the obvious symmetry of the model implies that the critical 
ratio is between c" 1 and c, which suggests that the critical ratio is equal to one: the probability 
of a successful invasion is positive if and only if > 1. Finally, we look at the naming game 
on regular lattices in higher dimensions. In this case, using a block construction to compare the 
process properly rescaled in space and time with oriented site percolation, it can be proved that 
the probability of invasion is positive for (f> sufficiently large. 

Theorem 4 - In any dimension, pa > whenever (ft is large enough. 

Our approach can be improved to get an explicit bound for the critical value for <ft but this bound 
is far from being optimal. We conjecture as in one dimension that the critical ratio is equal to one, 
which is supported by numerical simulations of the process. More generally, we conjecture that, on 
connected graphs in which the degree is uniformly bounded by a fixed constant K, the critical value 
is equal to one in the sense that the probability of invasion is bounded from below by a positive 
constant that only depends on K, in disagreement with the mean-field model. 



2. Preliminary results 

In this section, we state some basic properties about the naming game that will be useful in the 
subsequent sections. A common aspect of all our proofs is to think of the process as being con- 
structed graphically from independent Poisson processes that indicate the time of the interactions, 
a popular idea in the field of interacting particle systems due to Harris [5]. In the case of the naming 
game, additional collections of uniform random variables must be introduced to also indicate the 
outcome of each interaction. More precisely, for every edge (x, y) G E, we let 

• {T n (x,y) :n> 1} be the arrival times of a rate one Poisson process, and 

• {U n (x,y) : n > 1} be independent uniform random variables over (0, 1). 

Collections of random variables attached to different edges are also independent. The process is 
then constructed as follows: at time T n (x,y), the states at x and y are simultaneously updated 
according to the transitions in the left column of Table [TJ Since interactions involving both words 
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transitions for (Ct) 


condition on U n (x,y) 


possible transitions for (£t) 


1A 


/ A A \ /A A \ 

(A, A) -> {A, A) 


none 


any 


2A 


(A,AB)^(A,A) 


U n {x,y) <l-q B 


2A, 3A, 3B, 4A, 4B, 5A, 5B, 6B 


2B 


1 A A 7~) \ / * T~1 A T~t\ 

(A,AB) — > (AB,AB) 


U„{x, y)> 1 — q B 


2B, 3B, 4B, 5A, 5B, 6B (excludes 3A, 4A) 


3A 


(A,B)->(A,AB) 


U n {x,y) < PA-fB 


3A, 5A, 5B, 6B 


3B 


(A,B) (AB,B) 


U„(x, y) > PA^B 


3B, 5B, 6B (excludes 5 A) 


4A 


(AB, AB) -> (A, A) 


U n (x, y) < PA-+B 


4A, 5A, 5B, 6B 


4B 


(AB, AB) -> (B, B) 


U„(X, y) > PA^rB 


4B, 5B, 6B (excludes 5A) 


5A 


{AB, B) -> {AB, AB) 


U n {x,y) < q A 


5A, 6B 


5B 


(AB,B)^(B,B) 


Uu{x,y) > q A 


5B, 6B 


6B 


(B,B)^(B,B) 


none 


only 6B 



Table 1 

Coupling between the processes ((t) and (£t). 



can each result in two different outcomes depending on whether word A or word B is pronounced, 
the random variable U n (x,y) is used to account for the probability of each outcome as indicated 
by the conditions in the middle column of the table where 

QA ■= PAB^B PA^B and q B := PAB~>A PB^A- (3) 

Note that qA is the probability that word A is pronounced in a conversation involving a bilingual 
individual and a type B individual. One can easily check that the conditions in the table indeed 
produce the desired transition probabilities in ([I]). Based on this graphical representation, processes 
with different parameters or starting from different initial configurations can be coupled to prove 
important monotonicity results. The next lemma shows for instance a certain monotonicity of 
the naming game with respect to its initial configuration, which can be viewed as the analog of 
attractiveness for spin systems. This result will be useful in the proof of Theorem [3l 

Lemma 5 - Let (Ct) and (£t) be two copies of the naming game. Then, 

P(6(a:) =A) <P(( t {x) =A) and P = B) > P (( t {x) = B) 

for all (x, t) G V x (0, oo) provided this holds for all (x, t) G V x {0}. 

Proof. The result follows from a coupling of the two processes that we construct conjointly from 
the same graphical representation. That is, we assume that 

(£0(2) = ^ implies Co(^) = A) and (Co( z ) = B implies £o(z) = B) for all z G V 

and that both processes are constructed from the same Poisson processes and the same collections 
of uniform random variables. The construction given by Harris [5J, which relies on arguments from 
percolation theory, implies that, for any small enough time interval, there exists a partition of the 
vertex set into almost surely finite connected components such that any two vertices in two different 
components do not influence each other in the time interval. Since the number of interactions in 
each component in the time interval is almost surely finite, the result can be proved for each of 
these finite space-time regions by induction. Assume that 



(£t-(z) = A implies Ct-(z) = A) and (Ct- (z) = B implies f t _ (z) = B) for all z G V 
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for some arrival time t := T n (x,y). To prove that the previous relationship between both processes 
is preserved at time t, we observe that the interaction between the individuals at x and y can result 
in ten different transitions depending on the state of both individuals. These transitions are listed 
in the left column of Table [T] and can be divided into two types: 

• the transitions that create an A or remove a B, which are labeled 2A-5A, 

• the transitions that create a B or remove an A, which are labeled 2B-5B. 

As previously mentioned, except for transitions 1A and 6B, every other pair of states for the neigh- 
bors can result in two possible transitions depending on whether word A or word B is pronounced 
during the conversation. The last column of the table indicates that for all possible simultaneous 
updates of both processes, the ordering between both processes is preserved at time t, i.e., 

(£t(z) = A implies ( t (z) = A) and (Q(z) = B implies £ f (z) = B) for all z G V. 

To prove, as indicated in the last column, that a transition 2B in the first process indeed excludes 
the transitions 3A and 4A in the second process, we observe that 

1 - QB = PA^B + PB^A ~ PAB^A PB^A 

= PA^B + PB^A (1 - PAB^a) > PA->B 

which gives the implication 

U n (x,y) > 1 -q B implies that U n (x,y) > pa~>b (4) 

and proves the exclusion of type 3A and 4A transitions. Similarly, 

U n (x, y) > pa^b implies that U n (x, y) > pab^b Pa^b = QA (5) 

showing that the transitions 3B and 4B in the first process exclude transition 5A in the second 
process. As previously mentioned, the lemma follows from the fact that all possible simultaneous 
updates of both processes given in the last column preserve the desired ordering. □ 

3. The naming game on finite graphs 

This section is devoted to the proofs of Theorem [1] and Theorem [2] about the naming game on 
finite connected graphs. The key to proving the first theorem is to show that a certain process 
that depends on the difference between the number of individuals using word A and the number of 
individuals using word B is a supermartingale with respect to the natural filtration of the naming 
game, which allows to directly deduce the theorem from the optimal stopping theorem. To prove 
the second theorem which specializes in the process on complete graphs, the idea is to observe that, 
as long as bilingual individuals do not interact with each other, there is no type A individual in the 
population and the number of bilingual individuals evolves like a subcritical birth and death process 
that goes extinct quickly. Throughout this section, At and Bt denote respectively the number of 
individuals of type A and type B at time t, and we let 



et(X,Y) := number of edges connecting a type A individual 
and a type Y individual at time t 



(6) 
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for all X,Y 6 {A, B, AB}. To motivate our proof of the first theorem and explain the assumption, 
we observe that the transitions labeled 2A-5A in Table [U which are the transitions that increase 
the number of individuals using A or decrease the number of individuals using B, all occur with 
probability at least one half if and only if > 3. As shown in the next lemma, this property can 
be used to construct a certain supermartingale with respect to the natural filtration of the process: 
the (j-algebra Ft generated by the realization of the naming game until time t. 

Lemma 6 - Assume that > 3. Then, for all s > t, 

E (M s | F t ) < M t where M t := a At ~ Bt and a := 3/0. 
Proof. Using the transition probabilities in Table [IJ we get 

lim^o h' 1 E (M t+h - M t | F t ) = E?=- 2 ( ai ~ 1) M t l™/^o h" 1 P (M t+h = M t + j \ F t ) 
= {a -I) M t (e t (A, AB) (1 - q B ) + e t (A, B)p A ^ B + e t (B, AB) q A ) 

+ (a" 1 - 1) M t (e t (B, AB) (1 - q A ) + e t (A, B) p B ^ A + e t (A, AB) q B ) 
+ M t e t (AB,AB) ((a 2 - 1) PA -+b + (« -2 - 1)pb->a). 
Re-arranging the terms with respect to the type of edges, this becomes 

lim^o h~ l E(M t+h -M t \ F t ) = M t e t (A, AB) ((a - 1)(1 - q B ) + (a" 1 - 1) q B ) 

+ M t e t (B, AB) ((a -l)q A + (a" 1 - 1)(1 - «a)) 
+ M t e t (A, B) ((a - 1) p A ^ B + (a^ 1 - 1) p B ^ A ) 
+ M t e t (AB,AB) ((a 2 - l)p A ^ B + (a" 2 - 1)pb-+a). 
First, we observe that q B = (30 + and, for all > 1/3, 

(a - 1)(1 - q B ) + (a" 1 -l)q B = a" 1 (a - 1)((1 - q B ) a - q B ) 

= a' 1 (30 + l)^ 1 (a- l)(30a - 1) < for all (30)" 1 < a < 1. 

Similarly, <^ = (0 + 3)" 1 and, for all > 3, we have 

(a -l)q A + (a- 1 - 1)(1 - q A ) - 1 = a" 1 (a - l)(g A a - (1 - qr A )) 

= a~ 1 (0 + 3)" 1 (a-l)(0a-3) < for all 30" 1 < a < 1. 

Finally, p A ^ B = (0 + l) -1 and, for all > 1, we have 

(a - 1)pa->b + (a" 1 - 1)pb^a = a" 1 (a- l)(p A -> B a- p B ^ A ) 

= a" 1 (0+ l)" 1 (a- l)(0a- 1) < for all 0" 1 < a < 1 

from which we also deduce that, for all > 1, 

(a-l)p A ^ B + (a~ l -l)p B -+A <0 for all l/y/^<a<l. (10) 

Plugging (fT|)- (fT0j) into ©, we conclude that 

lim^o h" 1 E (M t+ h ~ M t \F t ) < for all > 3 and a = 3/0 

showing that (Mt) is a supermartingale for a = 3/0. □ 



(7) 



(8) 



(9) 
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Applying the optimal stopping theorem to (Mt) gives the following result. 

Lemma 7 - For all > 3, we have 

PA '■= inf xe y P x (B t = for some t) > 1 — 3/0 > 0. 

Proof. First, we introduce the stopping time 

T := mf{t:A t -B t £{-N,N}} 

where N denotes the number of vertices. Since the naming game on any finite graph converges 
almost surely to the configuration in which all individuals are monolingual of the same type, the 
stopping time T is almost surely finite. Using in addition that the process (Mt) is a supermartingale 
according to Lemma [HJ we deduce from the optimal stopping theorem that 

E Mt = E(a XT ~ YT ) < a N p A + a~ N (1 - p A ) < E M = a"^" 1 ) 

for all a = 3 /(ft < 1. In particular, 

PA > (a-^- 1 ) - a~ N )(a N - a^)" 1 

= (1-aXl-a 2 *)- 1 > I- a = 1-3/0 > 

which completes the proof of the lemma. □ 



Theorem [T] directly follows from Lemma [7] by observing that the probability pa in the statement 
of the lemma is precisely the probability pa in the statement of the theorem. We now focus on 
the naming game on the complete graph. Note that in this case the number of edges of each type 
can be expressed as a function of the number of individuals of each type, therefore (At, Bt) is now 
a continuous-time Markov chain. As previously mentioned, to prove that pa tends to zero as the 
number of vertices goes to infinity, the idea is to observe that, as long as bilingual individuals 
do not interact with each other, there is no type A individual in the population and the number 
of bilingual individuals evolves like a subcritical birth and death process. To make the argument 
precise, we introduce the birth and death process (Zt) starting with a single individual and with 
birth rate NqA and death rate N(l — q A ), i.e., 

limh_K) h' 1 P (Z t+ h = j\X t = i) = i Nq A for j = i + 1 

= iN(l — q A ) for j = i — 1. 

We start with the following preliminary result about the number of jumps J before extinction of 
subcritical birth and death processes. 

Lemma 8 - Fix <ft < 3 and e > 0, and let J := card {t : Z t ^ Zt-}. Then, 

P ( J > 2n e + 1 1 Z = 1) < e for all n e large. 
Proof. First, we note that, since < 3, 

QA = PAB—^B PA—^B = 777- ——^ = —77 < —-77 = 1 ~ «A 

2 (<pab + 4>b) <P + 3 + 3 
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from which it follows that 

P ( J < oo) = P (Z t = for some t \ Z = 1) = 1. 

Moreover, using that the number of paths of length In not crossing is bounded by the total 
number of paths of length 2n together with Stirling's formula, we get 

P(,/ = 2„ + l) < ( 2 "U (!-„)»+• < ('««('-«*»- 
V n / V 71 " 71 

for all n large. In particular, 

P(J>2n e + l) < P(J = oo) + f (4gA(1 - gA))rt < e 

n=n e v 

for all n e large since 4q^(l — q^) < 1- D 

The reason for introducing the birth and death process above is that the number of bilingual 
individuals evolves precisely according to this process until two bilingual individuals interact with 
each other, an event that we call a collision. In particular, it can be deduced from the previous 
lemma that the probability that a collision ever happens is small when iV is large, which is also a 
bound for the probability that word A outcompetes word B. To prove this result, we let 

tc '■= inf {t : t = T n (x, y) for some x,y € V with r)t-(x) = i]t~(y) = AB}. 

be the time of the first collision. 

Lemma 9 - Fix <ft < 3 and e > 0. Then, 

P (t c < oo | A> = and B = N - 1) < 2e for all N large. 

Proof. To begin with, we observe that, before the time tc of the first collision, there is no 
monolingual individual of type A in the population. In particular, using the expression of the 
transition probabilities in the second column of Table [lj and introducing 

r(i,j) := lim^o h~ l P (A t+h = A t + i and B t+h = B t + j \ P t ), 

we obtain that, before the first collision, 

r(0,-l) = q A e t (B,AB) r(+2,0) = p A ^ B e t (AB,AB) 

r(0,+l) = (l-q A ) e t (B,AB) r(0,+2) = Pb ^a e t (AB, AB) 

whereas r(i,j) = for all other values of i and j. This implies that, before the first collision, 
the number of bilingual individuals has evolved according to the birth and death process in which 
individuals independently give birth at rate Nq A and die at rate N(l — q A ). In particular, the 
naming game can be coupled with the birth and death process in such a way that 

P (A t = and (AB) t = Z t \ r c > t) = 1 
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where (AB)t denotes the number of bilingual individuals at time t. The rest of the proof relies 
on the fact that the probability that the number of jumps in the birth and death process is large 
and the probability that there is a collision in the naming game coupled with the birth and death 
process when the number of jumps is small are both small when the graph is large. Indeed, LemmaO 
gives the existence of n e fixed from now on such that 

P(t c <oo\ J = 2n + 1) P(J = 2n + l) < P(J>2n e + l) < e. (12) 

n>n e 

Moreover, when J = 2n + 1, the maximum number of individuals cannot exceed n + 1 in the birth 
and death process therefore, thinking again of the number of bilingual individuals as being coupled 
with the birth and death process before the first collision, at each jump, the probability of a collision 
is bounded by N~ 1 (n + 1). The integer n e being fixed, this implies that 

^2 P (t C < oo \ J = 2n + 1) P (J = 2n + 1) 

n<n t (I?) 

< ^ P ( T c < oo \ J = 2n + 1) < J2 N ~ (2n + l)(n+l) < e { ' 

n<n e n<n e 

for all N sufficiently large. The lemma simply follows by observing that the probability to be esti- 
mated is bounded by the sum of the probabilities in f)12|) and ()13|) . □ 

Theorem [2] directly follows from the next lemma. 
Lemma 10 - Fix <j) < 3 and e > 0. Then, 

P{r] t = A for some t \ A = and (AB) = 1) < 2e for all N large. 
Proof. Since there is no type A individual before the first collision, 

P(rj t = A for some t \ A = and (AB) = 1) 

< P (rj t (x) = A for some (x, t) G V x R+ | A = and (vl^o = 1) 

< P (t C < oo | A = and (AB) = 1) < 2e 
for all N sufficiently large according to Lemma [9j □ 

4. The naming game in one dimension 

This section is devoted to the proof of Theorem [3j The first and main step of the proof is to show 
almost sure invasion of word A for the naming game (Ct) starting with 

Co (a;) = A for all x < and ( (x) = B for all x > 0. (14) 

The main difficulty to prove this result is that, even in the presence of nearest neighbor interactions, 
the evolution rules in (pQ) can create infinitely many interfaces, i.e., the state space of the process 
seen from the rightmost type A individual that has only type A to her left is infinite. Motivated by 
numerical simulations of the process that suggest that the size of the interface is somewhat small 
most of the time, we prove the result for the process that has 



Zt(X t +j) = B forallj>3 



(15) 
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where 

X t := max {a; £ 7L : £t(y) = A for all y < x} 

but otherwise evolves according to the evolution rules (pQ). That is, the process starts from the con- 
figuration described in (|14p and evolves according to the evolution rules of the naming game except 
that, each time the configuration violates condition (115p . the state at vertex Xt + 3 instantaneously 
flips to a type B. In view of this new rule, Lemma [5] implies that 

P(Zt{x)=A)<P(( t (x)=A) and P = B) > P (( t (x) = B) 

for all x 6 Z and t > 0, from which it follows that 

lim P (C t (x) = A) = 1 for all x G Z 

t— i-OO 

whenever 

lim Xt = oo almost surely. (16) 

t— >oo 

Moreover, one easily checks that the modified process (£t) only has three possible interfaces corre- 
sponding to the following three types of configurations: 

(type 0) &(Xt + j) = B for all j > 1 

(typel) St(X t + l) = AB and &(X t + j)=B for all j > 2 

(type 2) £ t (X t + 1) = £ t (X t + 2) = AB and £ t (X t +j)=B for all j > 3. 

Indeed, only the transitions — > 1 and 1 — > and 1—^2 for the configuration types are allowed 
starting from a type or a type 1 configuration. Moreover, from a type 2 configuration, either 
a monolingual and a bilingual individuals interact, which results in a type 1 configuration or a 
configuration with three bilingual individuals which instantaneously flips to a type 2 configuration, 
or both bilingual individuals interact, which results in a type configuration. The main reason for 
introducing this modified process is its mathematical tractability due to the small size of the state 
space of the process seen from the interface. As previously mentioned, this is further motivated 
by the fact that numerical simulations suggest that the naming game itself, when starting from 
configuration (|14[) . is most of the time in type 0, 1 or 2 configurations, so the analysis of the 
modified process allows to obtain a bound c somewhat close to one. To establish Theorem [31 we 
now prove that, under the conditions of the theorem, (|16j) holds. This is done by first computing 
the occupation time of the modified process in each configuration type and then computing the 
value of the drift for the process (Xt) in each configuration type. To shorten the notations as in 
the proof of Lemma [6l we will again use the probabilities qA and qs defined in (|3|). 

Lemma 11 - The limits ttj := lim^oo P (£f is of type j) exist and satisfy 

ttq = 2 7Ti + (r — 2) 7T2 and r-n\ = (3 — r) TT2 where r:=qA + qB- (17) 

Proof. Let Yj := j if the configuration at time t is of type j. Looking at all the possible updates 
of the modified naming game and the corresponding transition rates in Figure El one easily checks 
that the configuration type evolves according to the Markov chain with transitions 

-> 1 at rate r i = pa^b + Pb^a = 1 

1 ->■ at rate r w = (1 - q B ) + (1 - ia) = 2 - (q A + 3b) 

1 — > 2 at rate T\i = qA + QB (18) 

2 ->■ at rate r 20 = Pa^b + Pb^a = 1 

2 ->■ 1 at rate r 2 i = (1 - qs) + (1 - Qa) = 2 - (q A + qB)- 
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P B ^ A B B B 

A l-q B A Qa 
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type 1 

O -4 • -4 ► • • 

QB i; '/! /> B 

A l-QB A PA-+B A QA 
• -4 ► • -4 ► • ► O 

O -4 • -4 ► • -4 ► • 

IB Q PB^A ft 1-qA ft 

x t 

Figure 2. Picture of the configuration types and all the possible transitions along with their rates. Each configu- 
ration is represented by two copies of the lattice, with the upper layer having a black particle to indicate that the 
individual uses word A and a white particle when she does not, and similarly for word B at the lower layer. Arrows 
indicate transitions where the individual at the tail speaks to the individual at the tip, while double arrows indicate 
transitions where any of the two neighbors speaks to the other one. The two dashed arrows in type 2 configura- 
tions correspond to the two transitions that are instantaneously followed by the event that the rightmost bilingual 
individual spontaneously becomes a type B monolingual individual. 



Note that the rates on the two dashed arrows of Figure [2] are irrelevant. These transition rates 
imply that (Yj) is irreducible therefore the limits 



iTj := lim P (configuration £ t is of type j) = lim P (Y t = j) for j = 0,1,2 

t— >oo t— >oo 

exist and satisfy the following two equations: 

7!"0 = (HO + n.2) TTl + 7*21 7T2 and r i2 7Tl = (1 + r 2 l) 7T 2 . 

Using also that ?r 2 = r and no = r 2 i = 2 — r according to (|18p gives 

ttq = 2 7Ti + (2 — r) 7T2 and r 7Ti = (3 — r) ix<i 
which is precisely (fT7|) . □ 



To prove (|16p . the next step is to compute the value of the conditional drift of the process (Xt) 
given the configuration type, i.e., 

Dj := lim^o h- 1 E (X t+h - X t \Y t = j) for j = 0, 1, 2. 

Looking again at all the possible updates, one easily finds 

Do = -Pb^a 

D 1 = (l-q B )-q B = l~2q B (19) 
D 2 = (1 - q B ) ~ QB + ZPA^B = Di + 2p A ^B- 
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The last step is to combine (|17|) and (I19p to prove that 



7T -Do + tti-Di + vr 2 I? 2 > for all </> > c (20) 

from which the almost sure convergence of the interface to infinity follows. The most natural 
approach is to express iTj and Dj for j = 0, 1, 2 as a function of </>, from which it can be deduced 
that the first inequality in (|2(jp holds for (j) larger than the largest real root of a certain polynomial 
with degree six. This root is not obvious to compute. Instead, we observe that, when both fitnesses 
are close to each other, <j) is close to one and the rate r close to 1/2. The next two lemmas show 
that the left-hand side of (|20p is larger than its counterpart obtained by computing ttj under the 
assumption r = 1/2, which allows to express c more simply as the largest root of a polynomial 
with degree two. Interestingly, a series of evaluations of the polynomial with degree six around c 
indicates that the largest real root of this polynomial only differ from c by less than 10~ 6 , which 
shows a posteriori the advantage of our approach. 

Lemma 12 - For all 4>a,4>b > 0, we have 1/2 < r < 1. 
Proof. Recalling ((TTJ) and using 4>a + 4>B = ^4>AB-, we get 

r := q A + qs 

= PAB^A PB^A + PAB^B PA^B 

= 4>b (2</>a + 2<^4b) -1 + <t>A(2<t>B + 2<Pab)~ 1 

= <M3 0a + <^)- 1 + ^a(3 0b + <M)- 1 = (30 + l)- 1 + (30- 1 + l)- 1 =: h(<f>). 
Noticing that h{(j)) = /i(<^~ 1 ) and differentiating with respect to (j), we deduce that 

h(l) = 1/2 < r < 1 = lim h(<f>), 

</)—>■ oo 

which completes the proof. □ 

Lemma 13 - For all 4>a,4>b > 0, we have 

sgn{Tt Q D {i + K 1 D 1 + n 2 D 2 ) > sgn(17A) + 10D 1 + 2 As). (21) 
Proof. Using the relationship among ^0,^1 and tx^ given in (I17D . we obtain 

sgn (7T -Do + TTl-Dl + 7r 2 -D 2 ) = sgn ((2 iri + (r - 2) ir 2 ) D + 7Ti D\ + 7r 2 D> 2 ) 

= sgn((2 J D + £'i)7ri + ((r-2)Do + £ , 2)7r 2 ) 
= sgn ((2D + £>i)(3 - r) + ((r - 2) + -D 2 ) r). 
To find a lower bound for the sign above, we introduce the function 

D(r) := (2D + D 1 )(3-r) + {(r-2)D + D 2 )r 
and observe that, for all r < 1, 

D'(r) = -(2D + D 1 ) + ((r-2)D + D 2 ) + D r 

= 2{r-2)D G -D 1 + D 2 = -2 (r - 2)(1 - Pa ^b) + 2pA^B 

= -2{r -2) + 2(r -l)p A ^B > -2 (r - 2) + 2 (r - 1) = 2 > 0. 
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Using in addition that 1/2 < r < 1 according to Lemma [12] gives 

sgn(vr OJ Do+vriZ)i+7r 2 Z)2) > sgn(D(l/2)) 

= sgn ((2D + £>i)(3 - 1/2) + ((1/2 - 2) D + D 2 ) (1/2)) 
= sgn(17£> + 10£>i + 2£> 2 ). 
This completes the proof. □ 



Lemma 14 - The right-hand side of (|2ip is positive whenever 

> c where c := ^ + ~ 1.053. (22) 

Proof. First of all, note that 

PB^A = (0+ir 1 QA = 0(0 + 3)^ QB = (30 
Using in addition (fT9|) gives 

(0 + l)(30 + l)A) = -(3^ + 1) 



o 



i 7 ! 
F 2 



+ l)(30 + l)Di = (30-l)(0 + l) 

+ 1)(30 + 1)D 2 = (30-l)(0 + l) + 20(30 + l). 



Since (0 + 1)(30 + 1) > 0, we deduce that 

sgn(17D + 10Di + 2 J D 2 ) = sgn (17 F + WF 1 + 2 F 2 ) 

= sgn (-17 (30 + 1) + 12 (30 - 1)(0 + 1) + 40 (30 + 1)) 
= sgn (480 2 - 230 - 29) 

which is positive whenever > c as defined in ([22]) . □ 

From Lemma [Til it directly follows that the process (Xt) converges almost surely to infinity, which 
also implies convergence of the naming game starting from configuration (|14j) to the configuration 
in which all individuals are type A monolingual. Moreover, we have 

P (X t > for all t) > 0. 

To deduce that word A can invade word B, we let (X t + ) and (Xf) be two independent copies of 
the process (Xt) and use a standard coupling argument to conclude that, under the assumptions 
of the theorem, the probability that the naming game starting with the origin in state A and all 
the other vertices in state B converges to the "all A" configuration is given by 

P (X+ > -X t " for all t) > P (X+ > and - X~ < for all t) 

> P (X+ > for all t) P (-Xf < for all t) 

> P(X t >0 for all t) P(X t >0 for all t) > 0. 



Since there is a positive probability for the process starting with a single bilingual individual at the 
origin that the origin is of type A at time one, this completes the proof of Theorem [3l 
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5. The naming game in higher dimensions 

This section is devoted to proving Theorem [H which relies on a block construction. To spare the 
reader complicated notations, we only prove the result in d = 2 but our approach easily extends to 
higher dimensions. The idea of the block construction is to couple a certain collection of good events 
related to the process properly rescaled in space and time with the set of open sites of oriented site 
percolation on the oriented graph Hi with vertex set 

H := {(z, n) G 1? x Z + : z\ + Z2 + n is even} 

and in which there is an oriented edge 

(z, n) —7- (z' , n ) if and only if z = z + (±1, ±1) and n' = n + 1. 

See the left-hand side of Figure |4] for a picture in d = 1. To rescale the process and define the 
collection of good events later in the proof of Lemma [T5l we let T := and introduce the 
collection of space-time blocks 

B(z,n) := {{x, t) = ((x\, X2), t) G Z 2 x [0, 00) such that 

Xj G {zj, Zj + 1} for j = 1, 2 and t G [2nT, 2(n + 1) T)} for all (z, n) G H. 

In words, space is partitioned into 2x2 squares and time into intervals of length 2T, while the 
collection of space-time blocks in (|23p defines a partition of the space-time universe. The key to 
proving invasion of word A is to show that the set of sites 

(z,n) G H such that rj t {x) = A for all (x,t) G B(z,n), 

that we call A-sites for short, dominates stochastically the set of wet sites in an oriented site perco- 
lation process whose parameter can be made arbitrarily close to one by choosing the parameter (j> 
sufficiently large. More precisely, we have the following lemma. 

Lemma 15 - For all e > 0, there exists <p > such that the set of A-sites dominates the set of 
wet sites in a two dependent oriented site percolation process with parameter 1 — e. 

Proof. We say that the interaction along edge (x,y) at time T n (x,y) is 

a good interaction if U n (x,y) < qA = 0(0 + 3)" 1 

and a bad interaction otherwise. Referring to Figure [3l we let G(z,n) be the event that 

1. between time 2nT and time (2n + 1) T, there are at least two good interactions along each of 
the eight edges labeled 1 on the left-hand side, 

2. between time 2nT and time (2n + 1) T, there is no bad interaction along any of the sixteen 
edges labeled 2 on the left-hand side, 

3. between time (2n + 1) T and time 2(n + 1) T, there is at least one good and no bad interaction 
along each of the eight edges labeled 3 on the right-hand side, 

4. between time (2n + 1)T and time (2n + 4) T, there is no bad interaction along any of the 
sixteen edges labeled 4 on the right-hand side. 
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Figure 3. Picture of the block construction. 

From dU)-© and the probabilities in Table [TJ it follows that an interaction involving at least one 
individual using word A can only result in one of the transitions 1A-5A in the table. In particular, 
whenever site (z, n) is an ^4-site and our good event 1-4 occurs, the following holds: 

• at time (2n + 1) T, all twelve vertices marked with a black dot • on the right-hand side of the 
figure are of type A and 

• between (2n + 2) T and (2n + 4) T, all sixteen vertices in the figure are of type A. 
In particular, letting £l(z, n) be the event that (z,n) is an ^4-site, we deduce that 

n(z,n) n G(z,n) C £l((zi ± 1, z 2 ± l),n + 1). (24) 

Now, let X and Y be the number of good and bad interactions that occur along one given edge in 
a given time interval of length T. Since interactions occur along each edge of the lattice at rate one 
and are independently good with probability <fi (<ft + 3) _1 

X = Poisson(0T(0 + 3)~ 1 ) and Y = Poisson (3T ((f) + 3) _1 ). 

In particular, for all e > 0, the probability of the good event 1—4 is 

P(G(z,n)) > 1 - 8P(X < 2) - 16P(F / 0) 

-8P(X = 0) -8P(Y 7^0) - 16 X 3P(Y / 0) 
= 1 - 16 P (X = 0) - 8 P (X = 1) - 72 P (Y ^ 0) 

= 1 - 8 (2 + <t>T {<j) + 3)" 1 ) exp(-#T (0 + 3)- 1 ) - 72 (1 - exp (-3T (0 + 3)" 1 )) 
> 1 - 8 (2 + <j)T (<j) + 3)" 1 ) exp(-0T (0 + 3)" 1 ) - 216 T (0 + 3)" 1 
= l-8(2 + 0vW + 3) _1 ) exp(-^^(</' + 3)" 1 ) -216^(<A + 3)~ 1 > 1-e 

for all (j) large enough. Finally, we observe that the good event G(z, n) is measurable with respect 
to the graphical representation in the space-time region 

(z,2nT) + {[-2,3] x [0,4T)} C Z 2 x[0,oo). 

This, together with the inclusion (I24p and the lower bound (|25h are exactly the comparison as- 
sumptions of Theorem 4.3 in [3], from which the lemma directly follows. □ 



(25) 
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It is known from standard results based on the so-called contour argument that, for e > small 
enough, there exists with positive probability an infinite cluster of wet sites in the two dependent 
oriented site percolation process on %\ starting with one open site at level and in which sites at 
the other levels are open with probability 1 — e. This, together with Lemma [T5| implies that, for 
the naming game starting with a single bilingual individual, 

liminf P (rj t (x) = A) > for all x G Z 2 . 

t— >oo 

This proves survival of word A but not extinction of word B with positive probability In fact, 
a weaker form of survival can be proved in the more general case when eft > 3 by simply using 
techniques similar to the ones in the proof of Lemma [6] to show that the number of individuals 
using word A is a submartingale. However, extinction of word B with positive probability cannot 
be deduced from this approach. In contrast, our coupling with oriented site percolation combined 
with an idea of the author [6] that extends a result of Durrett [3] can be used to complete the proof 
of the theorem. This is done in the next lemma. 

Lemma 16 - For all 4> large enough we have pa > 0. 

Proof. Throughout the proof, we think of the naming game as being coupled with oriented site 
percolation as in the statement of Lemma [HI To begin with, we follow [6] by introducing the new 
oriented graph %2 with the same vertex set as "Hi but in which there is an oriented edge 

(z, n) — > (z', n') if and only if (z' = z + (±1, ±1) and n' = n + 1) 

or (z' = z + (±2, ±2) and n' = n). 

See the right-hand side of Figure [J] for a picture in d = 1. We say that a site in the percolation 
process is dry if it is not wet. Also, for j = 1, 2, we write (to, 0) —}j (z,n) and say that there is a 
dry path connecting both sites if there is a sequence 

(20,0) = (w,0), (zi,m), (z k ,n k ) = (z,n) G H 

such that the following two conditions hold: 
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1. (zi,m) — > (zi+i, rit+i) is an oriented edge in Hj for all i = 0, 1, . . . , k — 1 and 

2. the site (zj, n^) is dry for all i = 0, 1, . . . , k. 

Note that a dry path in Hi is also a dry path in %2 but the reciprocal is false. Now, the proofs 
of Lemmas 4-11 in Durrett [3] imply the following: there exists e > small such that, for the 
percolation process on Hi with parameter 1 — e starting with (0, 0) open and all the other sites 
closed at level zero, conditioned on the event that percolation occurs, we have 

lim P((w,0) — H (z,n) for some w G 2Z 2 , 
m->oo (26) 

some z G -E>2(0, na) and some n > m) = 

for some a > 0. In words, if the density of open sites is close enough to one, there is a linearly 
expanding region in which (even closed) sites cannot be reached from a path of dry sites starting 
at level zero. This applies to dry paths in the graph Hi but as pointed out in [6], the proofs of 
Lemmas 4-11 in Durrett [3] easily extend to give (126p for dry paths in H.2- To conclude the proof, 
the last step is to show the connection between dry paths and ^4-sites. Assume that 

ri t (x) / A for some x G I? and t G [2nT, 2(n + 1) T). (27) 

Since word B cannot appear spontaneously, this implies the existence of 

xo, xi, . . . , x m = x G 1? and so = < s\ < • • • < s m +i = t 

such that f] s {xj) ^ A for all Sj < s < Sj+i and j = 0, 1, . . . , m, 

which in turn implies that 

(w,0) — >2 ( z , n ) for some w G 21? and (z,n) such that (x,t) G B(z,n). (28) 

Note however that this does not imply the existence of a dry path in Hi which is the reason why 
we introduced a new graph with additional edges. Taking the probability of the event in (|28p and 
the probability of the sub-event in (I27p directly gives 

P (f] t (x) / A) < P ((w, 0) (z, n) for some w G 21?) (29) 

where (z,n) is the unique site such that (x,t) G B(z,n). Since e > can be made arbitrarily small 
by choosing (j> large, the analog of (j26f) for oriented dry paths in the graph H2 together with the 
inequality (|29p implies that, conditioned on the event that percolation occurs, 

lim P (r] t (x) = A) = 1 for all x G I? 

t— >oo 

for the naming game conditioned on the event that (0, 0) is an yl-site. Since the probability that 
percolation occurs is positive for e > small and since there is a positive probability for the process 
starting with a single bilingual individual at the origin that all sites in the spatial box {0, l} d are 
of type A at time one, the lemma and Theorem [J] follow. □ 
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